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ABSTRACT 

The Northern HIPASS catalogue (NHICA T) is the northern extension of the 
HIPASS catalogue, HICAT ilMover el alJl2004^ . This extension adds the sky area 
between the declination range of +2° < 5 < +25°30' to HICAT's declination range of 
—90° < S < +2°. HIPASS is a blind Hi survey using the Parkes Radio Telescope cover- 
ing 71% of the sky (including this northern extension) and a heliocentric velocity range 
of -1,280 km s" 1 to 12,700 km s" 1 . The entire Virgo Cluster region has been observed 
in the Northern HIPASS. The galaxy catalogue, NHICAT, contains 1002 sources with 
Uhoi > 300 km s -1 . Sources with —300 km s _1 < «h c i < 300 km s _1 were excluded 
to avoid contamination by Galactic emission. In total, the entire HIPASS survey has 
found 531 7 galaxies identified purely by their HI content. The full galaxy catalogue is 
publicly-available at ihttp : //hipa s~aus-vo . org} . 
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1 INTRODUCTION 

The Hi Parkes All-Sky Survey (HIPASS) survey is a blind Hi 
survey using the Parkes Radio Telescope 1 , and the Northern 
extension increases this survey by a further 37 percent in sky 
area. The primary objective of extending Southern HIPASS 
to the north is to complement the southern census of gas-rich 
galaxies in the local Universe. 

The Hi mass function, HIMF, JZwaan et alJl2003l) and 
galaxy two-point correlation functkm~ ijMeve^^tH!uTl2005h 
based on Southern HIPASS showed that the statistical mea- 
sures of the galaxy populati on from HIPASS a re limited 
by cosmic variance. Recently. IZwaan et alJ IMPORT) used HI- 
CAT to investigate the effects of the local galaxy density 
on the HIMF. Using the n-th nearest neighbour statistic, 
they found tentative evidence that the low-mass end of the 
HIMF becomes steeper in higher density regions. These au- 
thors were able to examine the trend in the slope of the 
HIMF for different values of n in the statistic. Larger values 
of n correspond to sampling the density on larger scales. For 
each value up to n — 10, the slope became systematically 
steeper as the density increased. Thus it appears that the Hi 
properties of galaxies might be influenced by environmental 
effects on quite large scales (where a typical separation of the 
n = 10 nearest neighbours is ~ 5 Mpc), in addition to the 
well-known local effects, such as ti dal interactions between 
neighb ouring galaxies. Previously, iRosenberg fc Schneider! 
(2002) found a ~ —1.2 and a « —1.5 for the slope of the 
HIMF in the immediat e and field regions of t he Virgo Clus- 
ter, respectively. Also. ISoringob et alJ l)2005lL found natter 
slopes to the low mass end of the HIMF in higher density 
regions. However their galaxy sample was selected optically. 
Since Northern HIPASS covers the entire Virgo Cluster re- 
gion, the Northern HIPASS catalogue (NHICAT) can be 
used in conjunction with the Southern HIPASS catalogue 
(HICAT) to explore these trends and investigate the effects 
of cosmic variance on HIPASS galaxy catalogue statistics. 

It is worth noting that the Northern HIPASS also pro- 
vides the first blind Hi survey of the entire region in and 
around the Virgo Cluster. Assuming a Virgo distance of 16 
Mpc and an integrated flux limit of 15 Jy km s _1 this corre- 
sponds to a mass sensitivity of 9 x 10 8 Mq . Thus the survey 
will detect any Hi clouds above this mass limit in the vicinity 
of the Virgo Cluster, regardless of stellar content. 

The Virgo Cluster provides a nearby example of pro- 
cesses that are more common at higher redshifts, such as 
galaxy-galaxy and galaxy-intracluster medium interactions. 
Northern HIPASS will be used to investigate the role of Hi 
in a cluster environment in individual galaxies as well as sta- 
tistically across the whole cluster. Und erstanding the role o f 
Hi is vital for galaxy evolution models. iKennev et al.l (120041) 
found 6 galaxies in the Virgo Cluster sho wing distorted 
Hi m orphology. Using N-body simulations, IVollmer et alJ 
teOO ll) investigated the effect of ram pressure stripping in 
the Virgo Cluster and found that Hi deficiency is dependent 
on galaxy orbits within the cluster. They concluded that all 
the galaxies showing some form of distorted Hi distribution 



1 The Parkes telescope is part of the Australia Telescope which 
is funded by the Commonwealth of Australia for operation as a 
National Facility managed by CSIRO. 



have already passed through the centre of the cluster and 
are not infalling for the first time. 

The catalogue of extragalactic Hi s ources from H IPASS 
was named HICAT and was presented in lMever et al.l ll2004) 
(hereafter known as MZ04), while t he completen e ss an d 
reliability of HICAT was assessed by IZwaan et al] J2004) . 
Here we present a catalogue of extragalactic Hi sources from 
Northern HIPASS, named NHICAT. The basic parameters 
of HIPASS, Northern HIPASS, HICAT and NHICAT are 
given in Table Q Apart from the declination coverage, the 
main difference between the two surveys and catalogues 
is the higher noise level in Northern HIPASS. For a full 
summary of parameters of existing blind Hi surveys, in- 
cludin g subsets of HIPASS ( HIPASS Bright Galaxy Cata- 



logue, Kori^alski^^lJ 



alogue^ TKiiborn^t^dT 112002 



the South Celestial Cap cat- 
see Table 1 of MZ04. The 
No rthern HIPASS Zon e of Avoidance (NHIZOA) survey 
bv lDonlev et alJ ]2005t) covers northern declinations of the 
Galaxy - a subset of the Northern HIPASS area - at a higher 
sensitivity (RMS = 6 mjy beam -1 ). 

Optical identification of NHI CAT sources will use sim- 
ilar techniques to HICAT foovle et al] l2005h and will be 
presented in a later paper. With a total spatial cov- 
erage of 29,343 square degrees and 5317 sources, the 
combined HICAT and NHICAT catalogue is the largest 
purely Hl-selected galaxy catalogue to date. The Arecibo 
L-Band Feed Array (ALFALFA) surveys will eventu- 
ally cover the same region of sky as Northern HIPASS 
and will extend up to a declination of +36°. More 
information about the progress of ALFALFA can be 
found online at (http: //egg. astro . Cornell . edu/alf alf a > 
JCiovanelli et al]l2005h . 

In this paper we present NHICAT, together with the 
completeness and reliability analysis of the catalogue. Sec- 
tion 2 reviews Northern HIPASS and its properties. The 
source identification and the generation of the catalogue 
is described in Section 12.11 Section 12.21 discusses the noise 
statistics of Northern HIPASS and the completeness of NHI- 
CAT is analysed in Section 3.1. The narrowband follow-up 
observations and the reliability of NHICAT will be described 
in Section 3.2. 



2 NORTHERN HIPASS DATA 

Northern HIPASS was designed to cover all RAs in the 
region between declinations +2° < 5 < +25°. Observa- 
tions were undertaken using the 21-cm Multibeam receiver 
iStavelev-Smith et al] 119961) on the Parkes radio telescope 
during the period from 2000 to 2002. Observations were 
made by scanning in 8° strips of sky with 7 arcminutes in RA 
of separation between scans. A 1024 channel configuration 
covering a 64 MHz bandwidth was used in the Multibeam 
correlator to give a channel separation of Av = 13.2 km s _1 
across the heliocentric velocity range of -1,280 km s _1 to 
12,700 km s _1 . The observation and reduction methods are 
exactly the same as Souther n HIPASS and can be found in 
detail in lBarnes et al] (l200lTt . The northern dataset consists 
of 102 8° x 8° cubes and 48 8° x 7° cubes in the northernmost 
declination band. 

The catalogue includes sources in the range +25° < 8 < 
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Table 1. Survey and catalogue parameters. 



Survey name Survey range RMS Catalogue Name Catalogue range Sources 

{deg, km s — 1 ) (mJy beam -1 ) (deg, km s — 1 ) 

HIPASS 8 < +2°, 13 HICAT 8 < +2°, 4315 

(MZ04) -1280 < v < 12700 300 < v < 12700 

Northern extension +2° < 8 < +26°, 14 NHICAT +2° < 8 < +25° 30', 1002* 

HIPASS, this work -1280 < v < 12700 300 < v < 12700 



* Note that 1 source was found slightly below declination +2°. 

+25° 30'. At this declination, the telescope field of view is 
increased, though the sensitivity is significantly decreased. 

2.1 Northern HIPASS Catalogue (NHICAT) 

NHICAT was generated using essentially the same method 
as HICAT, with some improvements in efficiency. An up- 
dated version of the TopHat finder algorithm (see MZ04 
for details of the original TopHat) was used to identify 
sources. The updated TopHat finder is very effective at fil- 
tering false detections which have narrow velocity widths. 
Velocity widths with a FWHM of less than 30 km s _1 were 
considered to be too narrow to be extragalactic sources. Such 
narrow velocity width detections are usually associated with 
hydrogen recombination frequencies or known interference 
lines. The consistency of the updated finder was tested by 
comparing the output of the original and updated finder 
for the southern cubes. The updated finder returned exactly 
the same sources as the original version without the narrow 
velocity width detections. Two galaxy finders were used to 
generate HICAT: 'MULTIFIND' and TopHat. The 'MUL- 
TIFIND' finder uses a peak flux threshold method, whereas 
the TopHat algorithm involves the cross-correlation of spec- 
tra with tophat profiles at various scales (MZ04) . Although 
the original version of TopHat was reported to find ~90% of 
final catalogue sources in southern HICAT (MZ04), further 
tests showed that all sources with Speak > 100 mjy were 
recovered by the updated TopHat finder. Since 'MULTI- 
FIND' generated many more false detections in the north- 
ern data due to the different level and character of noise 
(see section 2.2), we decided to use the updated TopHat 
finder only. The TopHat finder was found to be quite ro- 
bust against the increased level of noise and baseline ripple. 
In addition, since the updated TopHat finder is much more 
effective, separate radio-frequency intereference (RFI) and 
recombination line removal (as described in MZ04) was not 
necessary. 

Although the known narrowband RFI have been filtered 
out in the process described above, not all the known RFI 
have been removed from the data. Not only is the Sun (and 
the reflections of the Sun) the strongest source of interfer- 
ence but it also provides broadband interference which could 
not be easily filtered out. This solar interference produces a 
standing wave effect (known as 'solar ripple' ) in our data 
which in turn will affect the effectiveness of galaxy detec- 
tion. These standing waves are likely to be worse at lower 
elevations due to the ground reflection of the Sun. 

NHICAT was constructed by first running the TopHat 



galaxy finder on all cubes, excluding the region —300 < 
Whel < 300 km s to minimise confusion with Galactic emis- 
sion. 14,879 galaxy candidates were found. Each candidate 
source was then checked manually by simultaneously dis- 
playing the source in four ways: a spectrum, an integrated 
intensity map, a declination-velocity projection and a Right 
Ascension-velocity projection. A candidate source was ac- 
cepted if it had a spectral profile which was easily distin- 
guishable from the noisy baseline and a position-velocity 
profile which was wider than 2 pixels across the position 
axes. On the other hand, a candidate source was rejected if 
either its spectral profile was not distinguishable from the 
noise or if its position-velocity projections showed a distinct 
signal which was exactly 2 pixels wide, as these sources are 
usually the result of interference in the data. All the ac- 
cepted sources were then passed to a semi-interactive param- 
eter finder. Sources were flagged during parameter finding 
as either 1, 2, 3, 4 or a high-velocity cloud (HVC) detec- 
tion: Flagyl represented a definite source detection, 2 rep- 
resented source detection with less certainty, 3 represented 
a source detected on the edge of a cube and 4 represented a 
non-detection. The Flag=4 option is provided to the param- 
eter finder in the case of misclassification of a source dur- 
ing the checking stage. It should be noted that a significant 
number of HVCs were detected in Northern HIPASS at RA 
~ 23 hours due to the Magellanic stream. The Magellanic 
Str eam from the Norther n HIPASS data has been presented 
bv IPutman et ai] (j2003). Source lists from all cubes were 
then merged and duplicates were removed. The process of 
merging matched every Flag=3 source with the detection 
of the same source in the neighbouring cube with the same 
overlap region. The procedures for the manual checking and 
parametrisation used are exactly the same as MZ04. More 
detailed descriptions of these procedures can be found in 
sections 3.2 and 3.3 of MZ04. Bivariate distributions of the 
basic properties (heliocentric velocity, velocity width, peak 
flux and integrated flux) of the sources detected in Northern 
HIPASS are shown in FigureQ Single parameter histograms 
are shown on the diagonal of this Figure. 

The cubes were then re-checked for missed sources us- 
ing a semi-interactive process. Sources detected in each 
cube were marked with a cross on an integrated intensity 
map. These maps were then manually checked for unmarked 
sources. From these manual checks, 15 additional sources 
were detected. The galaxy finder missed sources which were 
close to several other sources. 

Two sources may also have been identified as the same 
source. Such sources can be differentiated by inspecting the 
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spectra and the spatial moment maps since separate sources 
will not overlap both spatially and spectrally. These partic- 
ular sources were flagged as 'confused'. There are two 'con- 
fused' sources in NHICAT. 



Extended sources were identified and fitted in the same 
fashion as in MZ04 (see section 3.5 of MZ04 for more de- 
tails). All sources greater than 7' in size were identified as 
potentially extended sources. The integrated flux limit cor- 
responding to a fixed source size can be determined using 



the relationship between integrated flux Smt, and source di- 
ameter, 



Sint = / SdV « 1.26 



Hi 



(1) 



An explanation and derivation of Equation^can be found in 
MZ04. The Smt limit corresponding to a source size greater 
than 7' is 57 Jy km s _1 .In total, 41 candidates were found 
to have a measured flux greater than this limit. The moment 
maps of each of these sources were then examined and two 
sources were found to be extended. 

All the cube velocities are heliocentric and use the opti- 
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Figure 2. Skymap of detections found in Northern HIPASS. The 
lines mark increasing declinations inwards where the centre is 
the north pole. The radial divisions show increasing RA in an 
anti-clockwise direction starting with hours at the top of the 
diagram. The dotted lines mark lines of Galactic latitude, b. 



cal definition where velocity,!) = c(/ //)// where c, /„ and 
/ represent the speed of light, rest and observed frequency 
respectively. The sources were th en assigned names accord- 
ing to the convention of HICAT iMever et alJl2004l) . 

The final stage of the processing involved checking the 
19 sources located at <5 < +2°. Of these, 18 sources have al- 
ready been catalogued in HICAT showing that the datasets 
and analysis techniques are consistent between the two sur- 
veys at the 95% level. The extra source is located only 
slightly south of 8 — +2° and has been retained in NHI- 
CAT so that the final combined NHICAT and HICAT cata- 
logue will be a complete catalogue of the southern skies up 
to 8 = +25°30'. 

Northern HIPASS covers 7997 square degrees of sky and 
1002 galaxies were found from their Hi content in this re- 
gion. The parameters provided with the catalogue are the 
same as the parameters given in HICAT. Detailed descrip- 
tions of each parameter can be found in Table 4 of MZ04. 
Figure [5] shows the spatial distribution of all the sources 
found in NHICAT. Note that the cluster of sources at RA, 
a ~ 12 hours is the Virgo cluster. Galaxies are also found 
at low Galactic latitudes - a region often avoided by opti- 
cal ga laxy surveys. As me ntioned in the introduction, NHI- 
ZOA jDonlev et alJ feOOoD ) covers a subset of the Northern 
HIPASS survey region. We re-identify 23 galaxies from the 
NHIZOA catalogue in the overlapping regions. The RMS of 
NHIZOA is 2.33 times less than that of the observations in 
Northern HIPASS. Assuming that the detected number of 
galaxies is based solely on the sensitivity of the observations, 
one would expect Northern HIPASS to find only 33 of the 77 
NHIZOA sources. There are fewer matches between North- 
ern HIPASS and NHIZOA because the source detection rate 
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Figure 3. Peak-normalised distributions of pixel flux of entire 
cubes in HIPASS. The distribution of the pixel flux from a typical 
northern cube (cube number 538) is represented by crosses and 
the distribution marked by asterisks represent the pixel flux from 
a typical southern cube (cube number 194). A parabola (solid 
line) is shown to compare these distributions with Gaussian noise 
statistics. 



is not fully described by the sensitivity (as explained in the 
following section). 

In accordance with HICAT conventions, NHICAT only 
included detections with Vhel > 300 km s~ . One galaxy, 
HIPASSJ1213+14a, was found to have a mean Hi v^d of 
-222.7 kms" 1 , S peak = 0.181 Jy and S int = 42.9 Jy 
km s — l This galaxy was detected because its velocity width 
extended into the heliocentric velocity range where «hci < 
-300 km s _1 in summary, no galaxies with v^ c i < —300 
km s -1 were found in NHICAT. The number density of the 
sources found in Northern HIPASS was approximately 0.13 
sources per square degree of sky. In comparison, 0.20 sources 
per square degree of sky were found in Southern HIPASS. 
The cause of this difference will be discussed later in this 
paper. 



2.2 Noise characteristics 

Fewer sources were detected in NHICAT at higher decli- 
nations than in HICAT, as can be seen in Table (which 
shows the number density in NHICAT and HICAT). Al- 
though some deviation is expected due to cosmic variance, 
the most likely cause of this density difference is the higher 
level of noise in Northern HIPASS. The variation in gain 
and system temperature (T sys ) of the telescope with re- 
spect to elevation are insufficient to account for the higher 
noise observed in the northern survey. The level of noise in 
Northern HIPASS is greater than in Southern HIPASS be- 
cause the Parkes radio telescope observes northern sources 
at lower sky elevations. This results in the telescope gather- 
ing a greater amount of interference from ground reflections 
and the Sun. As the most northerly areas of the survey were 
only observable during a short LST window, sidelobe solar 
interference was often unavoidable. 

As shown in Table the RMS of both Northern and 
Southern HIPASS are very similar. However, the Northern 
HIPASS cubes appeared much 'noisier' in the flux density 
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Table 2. Comparison of source number density in NHICAT and HICAT. Note that 1 source in NHICAT was detected below 2° in 
declination. 



Catalogue 


Declination range 


Area of sky (sq degs) 


No. of sources 


Number density (per sq degs) 


HICAT 


-90° < 8 < +2° 


21,346 


4,315 


0.20 


NHICAT 


+2° < 8 < +10° 


2,862 


413 


0.14 




+10° ^ <5 < +18° 


2,792 


364 


0.13 




+18° ^ <5 < +25° 


2,343 


224 


0.10 




60 120 180 240 300 

Upper 1 -percentile noise [mjy] 

Figure 4. Skymap of the 1-percentile pixel flux map of both 
Northern and Southern HIPASS. The south pole (8 = -90°) is in 
the centre and RA increases in an anti-clockwise direction starting 
with hour at the top of the diagram. Observations through the 
Galaxy (where b = 0°) correspond to the darker horizontal band 
of cubes. The southern cube identification numbers range from 
1-388 and the northern cubes are 389-538. 



maps than that of the cubes in Southern HIPASS. Hence, 
the RMS method is not an effective way of illustrating why 
Northern HIPASS appeared 'noisier'. 

The aim of this section is to characterise the distribution 
of noise observed in Northern HIPASS cubes. One method 
is to examine the distribution of all the pixel flux values in 
a given cube and measure the 99-percentile value of this dis- 
tribution. This measure illustrates the noise characteristics 
in a given cube by measuring the extent of the outlying pixel 
flux values in the pixel distribution. Instead of characterising 
the noise in terms of the width of the flux distribution (as 
in the RMS method), we now examine the extent to which 
the outlier population is extended. 

Figure shows the peak-normalised pixel flux distribu- 
tions of 2 average cubes, one from Northern HIPASS (cube 
number 538) and the other from Southern HIPASS (cube 
number 194). Since Hi detections comprise very few pix- 
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0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 
Upper 1— percentile values (Jansky) 

Figure 5. Normalised histograms of 'noise' levels (1-percentile 
measures). The distribution of 'noise' levels in Southern HIPASS 
is represented by the solid line distribution. The dashed line dis- 
tribution represents the 'noise' levels in the declination band 
between +2° < 8 < +10° and the dotted line distribution 
represents the 'noise' levels in the declination bands between 
+10° < 8 < +26°. 

els compared with the entire cube, the Hi sources have not 
been removed from the plot. The excess of negative flux in 
Northern cubes (as seen in Figure is due to the band- 
pass removal and calibration method. Negative bandpass 
sidelobes occur at d eclinations either side of bright sources 
l|Barnes et alJl200li) . This means that bright interference is 
surrounded by negative artifacts, leading to an excess of 
both positive and negative pixels in the data with stronger 
interference. An ideal cube with only Gaussian noise would 
have a parabolic distribution (as shown by the solid line) 
since the natural log of a Gaussian distribution, exp(— x 2 ), is 
— x 2 . As can be seen in Figure^] the offset from the parabola 
is greater in the Northern cube than in the Southern cube, 
suggesting broader distributions of pixel noise values in the 
North. 

Thus the 'noise' level can now be characterised by mea- 
suring the extent of outliers using the 99-percentile rank of 
the pixel flux distribution. In actuality, the 1-percentile rank 
has been used to avoid bias caused by actual Hi sources in 
this measure. Figure [I] shows the skymap of the 1-percentile 
measures for both Northern HIPASS and Southern HIPASS. 
It should be noted that the 3 outer bands of declinations 
(cube numbers 389 - 538) form the regions observed in 
Northern HIPASS, and the inner declination bands (cube 
numbers 1 - 388) represent the observed regions in Southern 
HIPASS. 

It is qualitatively obvious from Figure 2] that observa- 
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tions through the Galaxy appear much 'noisier' than obser- 
vations away from the Galaxy. It can also be seen that the 
northernmost declination band in Northern HIPASS is much 
noisier than the rest of the HIPASS observations. 

A quantitative version of Figure [I] is as shown in Fig- 
urc|^]with three normalised distributions of cube 'noise' lev- 
els (characterised by 1-percentile rank). The cube 'noise' 
levels of Southern HIPASS are represented by the solid 
line distribution; the dashed and dotted line represent cube 
'noise' level distributions of Northern HIPASS. The cube 
'noise' level of the southernmost declination band in North- 
ern HIPASS is shown by the dashed line distribution and the 
cube 'noise' level of the rest of Northern HIPASS is shown 
by the dotted line distribution. The median 'noise' level for 
the solid line and dashed line distributions are at 42 mjy 
and 43 mjy respectively and the median 'noise' level for the 
dotted line distribution (northernmost HIPASS) is 55 mjy. 
By this measure, the noise level has increased by 31%. 

In conclusion, the 'noisiness' observed in the Northern 
HIPASS cube can be described quite accurately using the 1- 
percentile rank which measures the values of the pixel flux 
outliers in each cube. As mentioned before, the increase in 
the 1-percentile rank values for the northernmost declination 
band in Northern HIPASS is most likely a result of the in- 
crease in solar interference from increasingly lower elevation 
observations. 



3 COMPLETENESS AND RELIABILITY OF 
NHICAT 

The techniques used to calculate the completeness and re- 
liability in NHICAT are the same as the methods used for 
HICAT. Detailed descriptions of the methods used to anal- 
ys e the completeness and reliability of HICAT can be found 
in lZwaan etail j2004h . 

3.1 Completeness of NHICAT 

Synthetic sources were inserted into all the Northern 
HIPASS cubes before NHICAT was constructed in order to 
measure the completeness of the resulting catalogue. These 
synthetic sources were then extracted after the parame- 
ter finding process. In total, 774 non-extended synthetic 
sources were inserted into the Northern HIPASS cubes. 
These sources represent a random sample of sources ranging 
in 50% velocity width (W50) from 20 to 650 km s _1 , from 
0.02 to 0.13 Jy in peak flux (S p ) and from 300 to 10000 
km s _1 in heliocentric velocity («hci). 

The completeness of recovery can be easily estimated 
by measuring the fraction of fake sources (recovered in each 
parameter bin), D: 

D(S P , W) = N^ VCIcd (S p , W)/N iake (S p , W) (2) 

However, the completeness as a function of one parame- 
ter cannot be effectively measured solely using D. The com- 
pleteness, C, of NHICAT can be measured via the ratio of 
the number of detected real sources, N, over the true num- 
ber of sources in each bin. The true number of sources in 
each bin can be estimated by N/D. Since the number of 
sources in each bin differs and the parameter distribution of 



the fake sources may be different from the distribution of 
the real galaxies, this method cannot give a very good esti- 
mate of the completeness as a function of one parameter. A 
correction can be made by integrating over another param- 
eter and applying a weighting to account for the different 
number of sources in each bin. As an example, C(S p ) can 
be estimated by integrating over W : 

V% =0 N(S P ,W) 
° l,5pJ Z~ =0 N(S P ,W)/D(S P ,W) W 

Likewise, C(W) and C(Sint) have been calculated by 
integrating over S p . Figure HJ shows the completeness of 
NHICAT as functions of S p , W and Sint respectively where 
the solid lines are error function fits to the datapoint. The 
error bars on the datapoints were determined using boot- 
strap re-sampling and show 68 percent confidence levels. The 
error function fits and the completeness limits at 95% confi- 
dence levels are giv en in Table |3] Using the same method as 
IZwaan et alJ <l2004l) . different fitting functions were tested 
in order to fit the completeness as a function of two param- 
eters. The completeness of NHICAT as a function of S p (Jy) 
and Sint (Jy km s~ ) is also shown in Table [3] 

The completeness (C(S P )) limits at 95% confidence 
level is at 68 mjy for HICAT, while NHICAT's completeness 
at the same confidence level is 91 mjy. It appears that to 
first order, the completeness of S p scales with the noise level. 
However, it would be too simplistic to assume that the noise 
levels and source detection scale linearly . It should also be 
noted that cosmic variance has not been taken into account. 

3.2 Reliability of NHICAT 

The reliability of NHICAT was measured by re-observing a 
subsample of NHICAT sources in the narrowband mode at 
the Par kes Telescope. As w ith the reliability estimation of 
HICAT |z waan et alJl2004f) . a random sample representing 
the full range of NHICAT parameters was chosen, while giv- 
ing preference to NHICAT detections with low Sint and S p 
(generally with Sint < 8 Jy km s _1 and S p < 0.07 Jy). 

3.2.1 Narrowband observations 

In addition to calculating the reliability of NHICAT, the 
narrowband observations were also used to remove spuri- 
ous detections from the catalogue. As such, the less certain 
source detections (Flag=2) were observed as higher priority. 
In addition, definite source detections (Flag=l) were chosen 
randomly by the observer for observation. These narrow- 
band observations took place over 4 observing sessions from 
July 2003 to February 2005. 

A spectral resolution of 1.65 km s~ x at z — was 
used by observing with the narrow-band mode which con- 
sisted of 1024 channels over 8 MHz. Integration times 
were approximately 15 minutes for each source. The spe- 
cific details of the observing mode used can be found in 
IZwaan et alJ i2004l) . The narrowband observations were re- 
duced using the AIP S++ packages livedata and gridzilla 
jBarnes et alJl2001fl as with the narrowband observations of 
HICAT. 

The percentage of rejected sources in the NHICAT nar- 
rowband observations was greater than for HICAT. A likely 
explanation is that the signal-to-noise level is less in the 
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Figure 6. Completeness of NHICAT as a function of S p , W50 and S- ln t. 



Table 3. Completeness of NHICAT. 



Parameter 


Completeness Fit 


C=0.95 


S P (Jy) 


erf[18.5(S'p - 0.02)] 


0.095 


S int (Jy km s- 1 ) 


erf[0.1(S int - 1.0)] 


15.0 


Sp(Jy), Sint (Jy kms- 1 ) 


erf[20.0(S p - 0.005)]crf[0.14(5 int - 1.0)] 





north and thus source detection algorithm is less effective 
than in the south. 

A total of 857 sources were observed, of which 172 (20%) 
were rejected, compared with the narrowband observations 
of HICAT where less than 10% of the observed sources 
were rejected. Figure Q shows the peak flux distributions 
of the follow-up observations in both Northern and South- 
ern HIPASS. As can be seen from this figure, more sources 
were rejected at S pea k > 0.05 Jy in the Northern follow-up 
observations than in the Southern follow-up observations. 
Also, the percentage of sources with Speak > 0.05 observed 
in the Southern follow-up observations are higher than in the 
Northern observations which may also explain the difference 
in the percentage rejected. 

3.2.2 Reliability measure 

As explained in lZwaan et alJ (120041) . the reliability will im- 
prove when more uncertain sources are removed after the 
narrowband follow-up observations. Therefore we start by 
examining the original catalogue before unconfirmed sources 
were removed. The ratio of the number of confirmed sources 
(A^°nf) to the number of observed sources (N^ B ) is defined 
to be : 



T(S P ,W) 



N£ ni (s P ,w)/K£(s P ,w) 



(4) 



The reliability R as a function of a single parameter is the 
mean of T, weighted by the number of sources in each bin. 
For example the reliability as a function of peak flux (S p ) 
is: 



R(S P ) 



Z% =0 N(S P ,W)xT(S P ,W) 
Z^ N(S P ,W) 



(5) 
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Figure 7. Peak flux, Speak (Jy)j distributions of the follow-up 
observations in Northern HIPASS (left) and in Southern HIPASS 
(right). The distributions marked with lines on a 45 degree angle 
represent the population of observed and confirmed detections, 
while, the distributions marked with horizontal lines represent 
the population of non-detected sources. 



Similarly, R(W) and i?(Smt) can be measured by integrating 
over S p . 

Since 20% of the observed sources in the initial NHICAT 
have been rejected and removed from the catalogue, the re- 
liability of the catalogue has been improved by re-observing 
a subsample of the sources. 

An estimate of the expected number of real sources has 
to be made in order to calculate the reliability of NHI- 
CAT after the removal of unconfirmed sources. The expected 
number of sources is given by: 



A^exp cctrcal — rmcd ~h (T X -/V u n ob served ) 



(6) 
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Figure 8. Reliability of NHICAT as a function of S p , W50 an d Sint- The histograms show the S p , W50 and S ln t distributions of NHICAT 
sources. 
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Figure 9. Example of 3 data products available online at 'http://hipass.aus-vo.org> for source HIPASS J0030+02. Clockwise from 
top left: Integrated intensity map, Hi spectra and a position-velocity projection intensity map. 



where -/V uno b SC rvcd is the number of sources that have not 
been reobserved. 

T can now be redefined as : 

A^cxp ect real 



T(S P ,W) = 



(7) 



N(S P ,W) 

where N(S P , W) is the total number of sources in NHICAT 
excluding the unconfirmed sources. Equation can now be 
used to calculate the reliability of the final NHICAT. 

Figure^Jshows the Sp, W and S 1B _t distributions of NHI- 
CAT sources as well as the reliability of NHICAT as func- 
tions of S p , W and Si n t where the dashed lines are error 
function fits to the datapoint. It should be noted that 871 
sources, 1002 sources and 764 sources are plotted in the S p , 
W and Sint distributions, respectively. This is due to the fact 



Table 4. Reliability of NHICAT. 



Parameter Reliability Fit 



C=0.95 



S p (Jy) 



erf[58(S p - 0.012)] 0.036 



that the rest of the sources have S p and Si n t greater than 
the parameter ranges given in Figure 2] The error bars on 
the datapoints were determined using bootstrap re-sampling 
and indicate 68 percent confidence levels. The error function 
fits and the completeness limits at 95% confidence levels are 
given in Table 01 

It is interesting though to note that the 95% level of re- 
liability in NHICAT is lower (in S p and S int ) than the 95% 
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level of reliability in HICAT. This result can be attributed 
to the fact that a larger proportion of the original NHICAT 
have been re-observed in the narrowband follow-up observa- 
tions. 



4 SUMMARY 

In the northern extension of HIPASS, which covers the re- 
gion between declinations +2° < 8 < +25°30', 1001 extra- 
galactic sources were found and catalogued in NHICAT. In 
addition an extra source found with 8 slightly less than +2° 
(which was not detected in HICAT) has also been included 
into NHICAT. 

NHICAT has been found to be 95% complete at peak 
flux 95 mjy and at an integrated flux 15 Jy km s _1 . This 
catalogue is also reliable at a 95% level at peak flux 36 
mjy. The entire catalogue and source spectra will be made 
publicly-available online at (http://hipass.aus-vo.orgi. 
The catalogue parameters presented in the online archive are 
the sa me 33 parameters detailed in Table 4 of lMever et alJ 
i2004l) . An excerpt of the online catalogue is shown in Ta- 
ble The online archive also contains the detection spectra 
and integrated intensity maps in various projections. Exam- 
ples of these data products are shown in Figure^ In addition 
to the Aus-VO archive, the catalogue and spectra will also 
be submitted to the NASA/IPAC Extragalactic Database 
(NED). 
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Table 5. Excerpt from NHICAT. Note that all the velocities are cz and heliocentric. 



Name 


RA 


Dec 


v™** (km s- 1 ) 


v™ n (kms" 1 ) 


«Sf* (km O 


v™ ln (kms" 1 ) 


"mom (km S L ) 




v Sp (km s" 1 ) 


i> g sr (km s _1 ) 


«l g (km s _1 ) 


"cmb (km s _1 ) 


Via 


(km s 1 ) 


"hi (km s _1 ) 


"spcdo (km s _1 ) 




"spechi (km s _1 ) 


"mask (km S _1 ) 


Wgg" (km s- 1 ) 


W$ n (km s- 1 ) 


VK 2 ^ ax (kms" 1 ) 


W^ in (kms" 1 ) 


S p (Jy) 




Sint(>/3/km s _1 ) 


RMS (Jy) 


RMSciip (Jy) 


RMScubo (Jy) 




Cube 


Sigma (km s _1 ) 


Box size (') 




Comment 


Follow-up 


Confused 


Extended 










HIPASSJ0030+02 


00:30:00.6 


02:05:46 


762.0 


5509.0 




5347.5 


765.3 


5362.3 




5514.5 


5370.3 


5344.9 


5336.2 




5111.8 


5590.2 


4055.5 




6973.1 


5112,5590 


361.8 


73.5 




434.8 


434.8 


0.060 




13.5 


0.0065 


0.0049 


0.01209 




389 


12 


7 




1 


1 
















HIPASSJ0033+02 


00:33:44.3 


02:40:37 


4340.8 


4454.8 




4390.4 


4336.6 


4402.7 




4478.3 


4411.8 


4386.3 


4377.6 




4242.8 


4542.0 


2989.5 




5941.5 


4243,4542 


225.9 


94.6 




248.0 


248.0 


0.069 




9.5 


0.0080 


0.0073 


0.01222 




390 


12 


7 




1 


1 
















HIPASSJ0142+02 


01:42:28.4 


02:56:20 


2988.9 


1763.9 




1764.0 


2989.1 


1762.8 




1744.6 


1772.0 


1746.6 


1737.9 




1685.4 


1843.3 


321.3 




3221.2 


1685,1843 


80.6 


80.6 




109.9 


109.9 


0.112 




9.0 


0.0070 


0.0058 


0.01362 




392 


12 


7 




1 



















HIPASSJ0150+02 


01:50:15.2 


02:18:58 


1503.8 


1695.8 




1696.6 


1505.4 


1696.0 




1677.9 


1703.0 


1677.6 


1668.9 




1618.9 


1750.0 


255.4 




3127.1 


1619,1750 


73.0 


73.0 




94.9 


94.9 


0.053 




3.5 


0.0065 


0.0060 


0.01362 




392 


12 


7 




1 


1 
















HIPASSJ0249+02 


02:49:06.4 


02:08:27 


1024.6 


1104.7 




1103.6 


1023.3 


1104.5 




1105.4 


1113.8 


1088.4 


1079.7 




1047.8 


1167.3 


-324.4 




2536.0 


1048,1167 


58.6 


58.6 




81.0 


81.0 


0.945 




55.6 


0.2363 


0.0543 


0.01380 




394 


12 


7 




1 



















HIPASSJ0253+02 


02:53:48.6 


02:20:42 


3645.8 


6787.8 




-9999.0 


3634.9 


6730.6 




6833.6 


6739.1 


6713.7 


6705.0 




6532.9 


6924.2 


5129.9 




8319.0 


6533,6924 


349.4 


235.8 




0.000 


0.000 


0.037 




9.2 


0.0072 


0.0066 


0.01440 




395 


12 


7 




1 


1 
















HIPASSJ0254+02 


02:54:05.6 


02:57:22 


6731.0 


2952.5 




3039.7 


-9999.0 


3036.2 




2937.1 


3043.9 


3018.5 


3009.8 




2864.7 


3212.5 


1490.0 




4603.0 


2865,3212 


264.4 


93.4 




293.6 


293.6 


0.090 




17.1 


0.0075 


0.0063 


0.01440 




395 


12 


7 




1 



















HIPASSJ0259+02 


02:59:48.2 


02:45:38 


376.0 


2793.6 




2848.4 


-9999.0 


2844.4 




2762.3 


2850.4 


2825.0 


2816.4 




2668.9 


3025.0 


1516.0 




4384.8 


2669,3025 


242.6 


137.5 




307.0 


307.0 


0.088 




16.9 


0.0075 


0.0069 


0.01440 




395 


12 


7 




1 
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